Measuring Semantic Similarities to Classify Instances for Ontology Learning
نویسندگان
چکیده
Ontology learning refers to generating scalable ontologies based on the web of documents. It includes two main processes: first, extracting concepts and their semantic relationships; second, classifying instances based on the extracted concepts. In this paper, we proposed an effective approach called Max Similarity Min Distance Algorithm (MSMDA) to address the second process. Traditionally, the instances are classified by a classifier, which is trained by a group of samples. The training process is time-consuming and labor-intensive. To avoid the training process, our approach uses Normalized Compression Distance (NCD) to measure the similarities to classify instances. We improved the traditional NCD algorithm by calculating the semantic similarities between instances. We conducted the empirical experiments based on two datasets. The experiment results indicate that MSMDA outperforms the traditional NCD algorithm in terms of
منابع مشابه
Public Transport Ontology for Passenger Information Retrieval
Passenger information aims at improving the user-friendliness of public transport systems while influencing passenger route choices to satisfy transit user’s travel requirements. The integration of transit information from multiple agencies is a major challenge in implementation of multi-modal passenger information systems. The problem of information sharing is further compounded by the multi-l...
متن کاملPartial Multi-dividing Ontology Learning Algorithm
As an effective data representation, storage, management, calculation and analysis model, ontology has been attracted more and more attention by scholars and been applied to various engineering disciplines. In the background of big data, the ontology is expected to increase the amount of data information, and the structure of its corresponding ontology graph has become complicated. Thus, it dem...
متن کامل3D Model Retrieval Based on Semantic and Shape Indexes
Classifying 3D models into classes is an important step in 3D model retrieval process. However, most classification system does not include semantic information. In this paper, a new method has been proposed to classify and to retrieve 3D model using semantic concepts and ontology. First, we use the machine learning methods to label 3D models by k-means algorithm in shape indexes space. Second,...
متن کاملExtract Features from WSDL Documents to Cluster Web Services with Ontology Learning
Web service discovery is becoming a challenging and time consuming task due to large number of Web services available on the Internet. Organizing the Web services into functionally similar clusters is one of a very efficient approach for reducing the search space. To cluster Web services, take out the Web services description languages documents and extract the features (e.g., service name) to ...
متن کاملBi-directional semantic similarity for gene ontology to optimize biological and clinical analyses
BACKGROUND Semantic similarity analysis facilitates automated semantic explanations of biological and clinical data annotated by biomedical ontologies. Gene ontology (GO) has become one of the most important biomedical ontologies with a set of controlled vocabularies, providing rich semantic annotations for genes and molecular phenotypes for diseases. Current methods for measuring GO semantic s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010